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Abstract — Polar codes, introduced recently by Arikan, are the 
first family of codes Itnown to achieve capacity of symmetric 
channels using a low complexity successive cancellation decoder. 
Although these codes, combined with successive cancellation, are 
optimal in this respect, their finite-length performance is not 
record breaking. We discuss several techniques through which 
their finite-length performance can be improved. We also study 
the performance of these codes in the context of source coding, 
both lossless and lossy, in the single-user context as well as for 
distributed applications. 

I. Introduction 

Polar codes, recendy introduced by Arikan in [1], are 
the first provably capacity achieving family of codes for 
arbitrary symmetric binary-input discrete memoryless chan- 
nels (B-DMC) with low encoding and decoding complexity. 
The construction of polar codes is based on the following 
observation: Let G2 = [ } ? ] ■ Apply the transform Gf " 
(where denotes the 71*'* Kronecker power) to a block 

of = 2" bits and transmit the output through independent 
copies of a symmetric B-DMC, call it W. As n grows large, 
the channels seen by individual bits (suitably defined in [1]) 
start polarizing: they approach either a noiseless channel or a 
pure-noise channel, where the fraction of channels becoming 
noiseless is close to the capacity I{W). In the following, let 
u denote the vector (ito, . . . , ujv-i) and x denote the vector 
ixo,...,XN-i). Let ^ : {0,...,A^- 1} ^ {0,...,7V- 1} 
be the permutation such that if the n-bit binary representation 
of i is bn-i ■ ■ - bo, then 7r(i) = bo . . . bn-i (we call this a bit- 
reversal). Let wt(i) denote the number of ones in the binary 
expansion of i. 

A. Construction of Polar Codes 

The channel polarization phenomenon suggests to use the 
noiseless channels for transmitting information while fixing 
the symbols transmitted through the noisy ones to a value 
known both to sender as well as receiver. For symmetric 
channels we can assume without loss of generality that the 
fixed positions are set to 0. Since the fraction of channels 
becoming noiseless tends to I{W), this scheme achieves the 
capacity of the channel. 

In [1] the following alternative interpretation was men- 
tioned; the above procedure can be seen as transmitting a 
codeword and decoding at the receiver with a successive 
cancellation (SC) decoding strategy. The specific code which 
is constructed can be seen as a generalization of the Reed- 
Muller (RM) codes. Let us briefly discuss the construction of 



RM codes. We follow the lead of [2] in which the Kronecker 
product is used. RM codes are specified by the two parameters 
n and r and the code is denoted by RM(n, r). An RM(7i, r) 
code has block length 2" and rate ^ J2i=o {!)■ code is 
defined through its generator matrix as follows. Compute the 
Kronecker product Gf". This gives a 2" x 2" matrix. Label 
the rows of this matrix as 0, . . . , 2" — L One can check that 
the weight of the ith row of this matrix is equal to 2"*''^ The 
generator matrix of the code RM{n, r) consists of all the rows 
of Gf"" which have weight at least 2"^''. There are exactly 
X]i=o Ci) ^^^^ rows. An equivalent way of expressing this 
is to say that the codewords are of the form x = uGf", 
where the components Ui of u corresponding to the rows 
of Gf" of weight less than 2""'' are fixed to and the 
remaining components contain the "information." Polar codes 
differ from RM codes only in the choice of generator vectors 
u, i.e., in the choice of which components of u are set to 0. 
Unlike RM codes, these codes are defined for any dimension 
1 < /c < 2". The choice of the generator vectors, as explained 
in [1], is rather complicated; we therefore do not discuss it 
here. Following Arikan, call those components u; of u which 
are set to "frozen," and the remaining ones "information" 
bits. Let the set of frozen bits be denoted by F and the set of 
information bits be denoted by I. A polar code is then defined 
as the set of codewords of the form x = uGf ", where the bits 
i E F are fixed to 0. 

Lemma 1 (RM Code as Polar Code): A RM(n, r) is a po- 
lar code of length 2" with F = {uj : wt(i) < r}. 

B. Performance under Successive Cancellation Decoding 

In [1] Arikan considers a low complexity SC decoder. We 
briefly describe the decoding procedure here. The bits are 
decoded in the order 7r(0), . . . ,7r(A^ — 1). Let the estimates 
of the bits be denoted by mq, ■ • . , un-i- If a bit Ui is frozen 
then Ui = 0. Otherwise the decoding rule is the following: 
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otherwise. 



> 1, 



Using the factor graph representation between u and x 
shown in Figure EJa), Arikan showed that this decoder can 
be implemented with O(A^logA^) complexity. 

Theorem 2: [3] Let W be any symmetric B-DMC with 
I{W) > 0. Let R < I{W) and /3 < i be fixed. Then for 
= 2", 71 > 0, the probability of error for polar coding 



under SC decoding at block length N and rate R satisfies 

P,{N,R)=o{2-^'). 

C. Optimality of the Exponent 

The following lemma characterizes the minimum distance 
of a polar code. 

Lemma 3 (Minimum Distance of Polar Codes): Let / be 
the set of information bits of a polar code C. The minimum 
distance of the code is given by dniin(C) = minig/ 2"*(*'. 

Proof: Let Wmin = miiiig/ wt(z). Clearly, d^in cannot be 
larger than the minimum weight of the rows of the generator 
matrix. Therefore, dmin < 2'"»"'". On the other hand, by adding 
some extra rows to the generator matrix we cannot increase the 
minimum distance. In particular, add all the rows of Gf" with 
weight at least 2"'"'°. The resulting code is RM(n, n — Wniin)- 
It is well known that rfmin(RM(n, r)) = 2""'' [2]. Therefore, 

dmin > rfmin(RM(n, n - Wmi„)) = 2'"™. ■ 

We conclude that for any given rate R, if the information bits 
are picked according to their weight (RM rule), i.e., the 2"i? 
vectors of largest weight, the resulting code has the largest 
possible minimum distance. The following lemma gives a 
bound on the best possible minimum distance for any non- 
trivial rate. 

Lemma 4: For any rate i? > and any choice of infor- 
mation bits, the minimum distance of a code of length 2" is 
bounded as dmm < 2^+^^^ for n > no{R) and a constant 
c = c{R). 

Proof: Lemma [3] implies that dmin is maximized by 
choosing the frozen bits according to the RM rule. The 
matrix Gf " has (") rows of weight 2*. Therefore, dmin < 

S'^ : ELfc+i (■) < 2"i? < EL. (■)• For R>h more 
than half of the rows are in the generator matrix. Therefore, 
there is at least one row with weight less than or equal to 
2l^^T. Consider therefore an R in the range (0,1/2]. Using 
Stirling's approximation, one can show that for any i? > 
at least one row of the generator matrix has weight in the 
range [2^5 T^'^^/", 2rtl+'=v^]. Using Lemma [3] we conclude 
the result. ■ 

Theorem 5: Let R > and /3 > ^ be fixed. For any 
symmetric B-DMC W, and N = 2", n > n{(3, R, W), the 
probability of error for polar coding under MAP decoding at 
block length N and rate R satisfies Pe{N, R) > 2"^" 

Proof: For a code with minimum distance dmin, the 
block error probability is lower bounded by 2^^''"'" for some 
positive constant K, which only depends on the channel. 
This is easily seen by considering a genie decoder; the genie 
provides the correct value of all bits except those which differ 
between the actually transmitted codeword and its minimum 
distance cousin. Lemma |4] implies that for any R > 0, for 
n large enough, dniin < -^^^^ for any f3 > ^. Therefore 
Pe{N, R) > 2-^" for any /3 > i. ■ 

This combined with Theorem|2] implies that the SC decoder 
achieves performance comparable to the MAP decoder in 
terms of the order of the exponent. 



II. Performance under Belief Propagation 

Theorem |2] does not state what lengths are needed in order 
to achieve the promised rapid decay in the error probabil- 
ity nor does it specify the involved constants. Indeed, for 
moderate lengths polar codes under SC decoding are not 
record breaking. In this section we show various ways to 
improve the performance of polar codes by considering belief 
propagation (BP) decoding. BP was already used in [4] to 
compare the performance of polar codes based on Arikan's 
rule and RM rule. For all the simulation points in the plots 
the 95% confidence intervals are shown. In most cases these 
confidence intervals are smaller than the point size and are 
therefore not visible. 

A. Successive Decoding as a Particular Instance of BP 

For communication over a binary erasure channel (BEC) 
one can easily show the following. 

Lemma 6: Decoding the bit [/, with the SC decoder is 
equivalent to applying BP with the knowledge of Uq, ■ ■ ■ , Ui-i 
and all other bits unknown (and a uniform prior on them). 

We conclude that if we use a standard BP algorithm (such 
a decoder has access also to the information provided by the 
frozen bits belonging to [/;+!, . . . , Um-i) then it is in general 
strictly better than a SC decoder. Indeed, it is not hard to con- 
struct explicit examples of codewords and erasure patterns to 
see that this inclusion is strict (BP decoder succeeds but the SC 
decoder does not). Figure[3]shows the simulation results for the 
SC, BP and the MAP decoders when transmission takes place 
over the BEC. As we can see from these simulation results, the 
performance of the BP decoder lies roughly half way between 
that of the SC decoder and that of the MAP decoder. For the 
BEC the scheduling of the individual messages is irrelevant to 
the performance as long as each edge is updated until a fixed 
point has been reached. For general B-DMCs the performance 
relies heavily on the specific schedule. We found empirically 
that a good performance can be achieved by the following 
schedule. Update the messages of each of the n sections of the 
trellis from right to left and then from left to right and so on. 
Each section consists of a collection of Z shaped sub-graphs. 
We first update the lower horizontal edge, then the diagonal 
edge, and, finally, the upper horizontal edge of each of these 
Z sections. In this schedule the information is spread from 
the variables belonging to one level to its neighboring level. 
Figure [T] shows the simulation results for the SC decoder and 
the BP decoder over the binary input additive white Gaussian 
noise channel (BAWGNC) of capacity i. Again, we can see a 
marked improvement of the BP decoder over the SC decoder 

B. Overcomplete Representation: Redundant Trellises 

For the polar code of length 2^ one can check that all the 
three trellises shown in Figure |2] are valid representations. In 
fact, for a code of block length 2", there exist n\ different 
representations obtained by different permutations of the n 
layers of connections. Therefore, we can connect the vectors x 
and u with any number of these representations and this results 



Pb 

m 

10' 

10-- 
10- 



0.35 

Pb 
lO'- 
lO-' 
10- 
10" 



0.40 



0.45 



Pb 
10- 
lO' 
10'- 
10- 

( 

Pb 
W 

m' 

10" 



0.35 



0.40 



0.45 



Pb 
10" 
10"' 
10"- 
10" 

C 

Pb 
10" 
10"- 
10"- 
10" 



0.35 



0.40 



0.45 



Pb 
10"- 
10"' 
10"- 
10"' 



0.35 



Pb 
10" 
10"' 
10"- 
10"' 



0.40 



0.45 



0.35 



0.40 



0.45 



0.35 



0.40 



0.45 



0.35 



0.40 



0.45 



0.35 



0.40 



0.45 



Fig. 1. Comparison of (i) SC and (ii) BP decoder in terms of block eiTor 
probability, when transmission takes place over the BAWGNC(cr = 0.97865). 
The performance curves are shown for n = 10 (top left), 11 (top right), 12 
(bottom left), and 13 (bottom light). 




Fig. 2. Figure (a) shows the trellis representation used by Arikan. Figures 
(b) and (c) show equivalent representations obtained by cyclic shifts of the 3 
sections of trellis (a). 

in an overcomplete representation (similar to the concept used 
when computing the stopping redundancy of a code [5]). For 
the BEC any such overcomplete representation only improves 
the performance of the BP decoder [5]. Further, the decod- 
ing complexity scales linearly with the number of different 
representations used. Keeping the complexity in mind, instead 
of considering all the 7i\ factorial trellises, we use only the 
n trellises obtained by cyclic shifts (e.g., see Figure |2]). The 
complexity of this algorithm is C?(iV(log 7V)^) as compared to 
0{N \ogN) of the SC decoder and BP over one trellis. The 
performance of the BP decoder is improved significantly by 
using this overcomplete representation as shown in Figure |3] 
We leave a systematic investigation of good schedules and 
choices of overcomplete representations for general symmetric 
channels as an interesting open problem. 

C. Choice of Frozen Bits 

For the BP or MAP decoding algorithm the choice of frozen 
bits as given by Arikan is not necessarily optimal. In the 
case of MAP decoding we observe (see Figure |4]i that the 
performance is significantly improved by picking the frozen 
bits according to the RM rule. This is not a coincidence; d^m 
is maximized for this choice. This suggests that there might 
be a rule which is optimized for BP. It is an interesting open 
question to find such a rule. 

III. Source Coding 

In this section we show the performance of polar codes in 
the context of source coding. We consider both lossless and 



Fig. 3. Comparison of (i) SC, (ii) BP, (iii) BP with multiple trellises, and 
(iv) MAP in terms of block error probability, when transmission takes place 
over the BEC(e = ^). The performance curves are shown for n = 10 (top 
left), 11 (top right), 12 (bottom left), 13 (bottom light). 
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Fig. 4. Comparison of block error probability curves under MAP decoding 
between codes picked according to Ankan's rule and RM rale. The perfor- 
mance curves are shown for n = 10 (left) and 11 (right). 

lossy cases and show (in most cases) empirically that they 
achieve the optimal performance in both cases. Let Ber(p) 
denote a Bernoulli source with Pr(l) = p. Let /i2(') denote 
the binary entropy function and ^(') its inverse. 

A. Lossless Source Coding 

1) Single User: The problem of lossless source coding 
of a Ber(p) source can be mapped to the channel coding 
problem over a binary symmetric channel (BSC) as shown 
in [6], [7]. Let x = {xo, . . . ,xi\!-i)'^ be a sequence of 
N i.i.d. realizations of the source. Consider a code of rate 
R represented by the parity check matrix H. The vector 
X is encoded by its syndrome s = Hx. The rate of the 
resulting source code is 1 — R. The decoding problem is 
to estimate x given the syndrome s. This is equivalent to 
estimating a noise vector in the context of channel coding over 
BSC(p). Therefore, if a sequence of codes achieve capacity 
over BSC(p), then the corresponding source codes approach a 
rate /12 (p) with vanishing error probability. 

We conclude that polar codes achieve the Shannon bound 
for lossless compression of a binary memoryless source. 
Moreover, using the trellis of Figure 13 a), we can compute 
the syndrome with complexity 0{N log TV). The source coding 
problem has a considerable advantage compared to the channel 
coding problem. The encoder knows the information seen by 
the decoder (unlike channel coding there is no noise involved 
here). Therefore, the encoder can also decode and check 



whether the decoding is successful or not. In case of failure, 
the encoder can retry the compression procedure by using a 
permutation of the source vector This permutation is fixed a 
priori and is known both to the encoder as well as the decoder. 
In order to completely specify the system, the encoder must 
inform the decoder which permutation was finally used. This 
results in a small loss of rate but it brings down the probability 
of decoding failure. Note that the extra number of bits that 
need to be transmitted grows only logarithmically with the 
number of permutations used, but that the error probability 
decays exponentially as long as the various permuted source 
vectors look like independent source samples. With this trick 
one can make the curves essentially arbitrarily steep with a 
very small loss in rate (see Figure |5]l. 
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Fig. 5. Comparison of SC decoding for a Ber(O.ll) .source with 0,1, and 
2 bits for permutations. The performance curves ai'e shown for n = 10 (left) 
and 11 (right). By increasing the number of permutations the curves can be 
made steeper and steeper. 

2) Slepian-Wolf: Consider two Ber(i) sources X and Y. 
Assume that they are correlated as X — Y @ Z, where 
Z ~ Ber(p). Recall that the Slepian-Wolf rate region is 
the unbounded polytope described by Rx > H{X) ~ 1, 
Ry > H{Y) = 1, Rx+Ry > HiX, r) = 1 + h2{p). The 
points [Rx,Ry) = (1, ^2(p)) and {Rx,Ry) = {h2{p), 1) are 
the so-called corner points. Because of symmetry it suffices to 
show how to achieve one such corner point (say (1, /i2(p)))- 

Let X and y denote N i.i.d. realizations of the two sources. 
The scheme using linear codes is the following: The encoder 
for X transmits x as it is (since H{X) ~ 1, and so no 
compression is necessary). Let H denote the parity-check 
matrix of a code designed for communication over the BSC(p). 
The encoder for Y computes the syndrome s = Hj/ and 
transmits it to the receiver At the receiver we know s = Hy 
and X, therefore we can compute s' = H(a; © y) = Hz. The 
resulting problem of estimating z is equivalent to the lossless 
compression of a Ber(p) discussed in the previous section. 
Therefore, once again polar codes provide an efficient solution. 
The error probability curves under SC decoding are equal to 
the curves shown in the Figure |5] with bits for permutations. 

B. Lossy Source Coding 

1 ) Single User: We do not know of a mapping that converts 
the lossy source coding problem to a channel coding problem. 
However, for the binary erasure source considered in [8], it 
was shown how to construct a "good" source code from a 
"good" (over the BEC) channel code. We briefly describe 
their construction here and show that polar codes achieve the 
optimal rate for zero distortion. 



The source is a sequence of i.i.d. realizations of a random 
variable S taking values in {0,1,*} with Pr(S' = 0) = 
Pr[S = 1) = ^(1 - e),Pr(S' = *) = e. The reconstruction 
alphabet is {0,1} and the distortion function is given by 
d(*,0) = d(*, 1) = 0, d(0, 1) = 1. For zero distortion, the 
rate of the rate-distortion function is given by R{D = 0) = 
1 — e. In [8] it was shown that the dual of a sequence of 
channel codes which achieve the capacity of the BEC(1 — e) 
under BP decoding, achieve the rate-distortion pair for zero 
distortion using a message passing algorithm which they refer 
to as the erasure quantization algorithm. Polar codes achieve 
capacity under SC decoding. For communication over BEC, 
the performance under BP is at least as good as SC. Therefore, 
the dual of the polar codes designed for BEC(1— e) achieve the 
optimum rate for zero distortion using the erasure quantization 
algorithm. Here, we show that the dual polar codes achieve the 
optimum rate for zero distortion even under a suitably defined 
SC decoding algorithm. 

The dual of a polar code is obtained by reversing the roles 
of the check and variable nodes of the trellis in Figure |2j a) and 
reversing the roles of the frozen and free bits. It is easy to see 
that Gf"(^f " = I. This impUes that the dual of a polar code 
is also a polar code. The suitably defined algorithm is given 
by SC decoding in the order 7r(A^ — 1), ... , 7r(0), opposite of 
the original decoding order. We refer to this as the dual order. 

In [1], the probability of erasure for bit u; under SC 
decoding is given by Z„ (1 — e), computed as follows. Let 
the n-bit binary expansion of i be given by &„_i,...,6o. 
Let Zo = 1 — e. The sequence Zi,. . . ,Z„ = Zn\l e) 
is recursively defined as follows: 



72 



1 - (1 - Zk-iY 



if bk-i = 1, 
if fofe-i = 0. 



(1) 



From [3] we know that for any R < e, there exists an uq 
such that for n > uq we can find a set / C {0, . . . , 2" — 1} of 
size 2"R satisfying Y.,^! ^« ^(1 ~ ^) < 2"^" for any /? < 5. 
The set / is used as information bits. The complement of / 
denoted by F, is the set of frozen bits. 

Let Zn (e) denote the probability of erasure for bit Ui of 
the dual code used for BEC(e). One can check that for the 
dual code with the dual order, the bit Ui is equivalent to the 
bit UN~i of the original code with the original decoding order. 
Let Zq = e. The recursive computation for Zn^ is given by 



Z,. 
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1 - (1 - Zk 



if bk- 
if bk- 



Lemma 7 (Duality for BEC): zli\l - e) + Z'il' {e) = 1. 
The proof follows through induction on n and using the 
equations dU and For the dual code the set / (information 
set for BEC(1 — e)) is used as frozen bits and the set F 
is used as information bits. Let x be a sequence 2" source 
realizations. The source vector x needs to be mapped to a 
vector Up (information bits) such that d(u(7f'",a;) = 0. The 
following lemma shows that such a vector up can be found 
using SC decoder with vanishing error probability. 



1=0, 
1 = 1. 



(2) 



Lemma 8: The probability of encoding failure for erasure 
quantization of the source S using the dual of the polar code 
designed for the BEC(1 — e) and SC decoding with dual order 
is bounded as Pe{N) = 0(2-^") for any /3 < i. 

Proof: Let / and F be as defined above. The bits 
belonging to / are already fixed to whereas the bits belonging 
to F are free to be set. Therefore an error can only occur 
if one of the bits belonging to / are set to a wrong value. 
However, if the SC decoding results in an erasure for these 
bits, these bits are also free to be set any value and this results 
in no error Therefore the probability of error can be upper 
bounded by the probability that at least one of the bits in 
/ is not an erasure which in turn can be upper bounded by 
E,e7(l-^^'^(e)) = E.e/^i''(l-£) =0(2-^'), where the 
last equality follows from Theorem |2] ■ 

The fact that the dual code can be successfully applied to 
the erasure source, suggests to extend this construction to more 
general sources. Let us try a similar construction to encode 
the Ber(i) source with the Hamming distortion. To design a 
source code for distortion D we first design a rate 1 — (p) 
polar code for the BSC(p) where p = h^^il — /i2 (£*)). The 
design consists of choosing the generator vectors from the 
rows of Gf" as explained in [1]. The source code is then 
defined by the corresponding dual code with the dual decoding 
order Since the rate of the original code is 1 — h2{p), the 
rate of the dual code is h2{p) = 1 — h2{D). Figure |6] shows 
the rate-distortion performance of these codes for various 
lengths. As the lengths increase, empirically we observe that 
the performance approaches the rate-distortion curve. It is 




0.0 0.2 0.4 0.6 0.8 R 0.0 0.2 0.4 0.6 0.8 -R 



Fig. 6. (left) Rate-Distortion curves for a Ber(O.ll) source. The performance 
curves are shown for n = 9, 11, 13, 15, 17. (right) The dashed line is the 
curve Rwz{D). The solid line is the lower convex envelope of Rwz{D) 
and (0,0.3) for a Ber(|) source with receiver having a side information 
obtained thi'ough a BSC(0.3). The performance curves are shown for n = 
9, 11, 13, 15. 

an interesting and challenging open problem to prove this 
observation rigorously. 

2) Wyner-Ziv: Let X denote a Ber(i) source which we 
want to compress. The source is reconstructed at a receiver 
which has access to a side information Y correlated to X 
as Y ~ X Q) Z with Z ~ Ber(p). Let x and y denote a 
sequence of N i.i.d. realizations of X and Y. Wyner and Ziv 
have shown that the rate distortion curve is given by the lower 
convex envelope of the curve Rwz{D) ~ h2{D * p) — h2{D) 
and the point (0,p), where D *p ~ D{1 — p) + (1 — D)p. As 
discussed in [9], nested linear codes are required to tackle this 
problem. The idea is to partition the codewords of a code Ci 
into cosets of another code C2. The code Ci must be a good 



source code and the code C2 must be a good channel code. 

Using polar codes, we can create this nested structure as 
follows. Let Ci be a source code for distortion D (Bernoulli 
source, Hamming distortion) as described in the previous 
section. Let be the set of frozen (fixed to 0) bits for this 
source code. Using this code we quantize the source vector x. 
Let the resulting vector be x. Note that the vector x is given 
by -uGf where u = (uq, . . . , un~i) is such that u,; = for 
i E Fs and for i G F^', Ui is defined by the source quantization. 

For a sequence of codes achieving the rate-distortion bound, 
it was shown in [10] that the "noise" x ® x added due to 
the quantization is comparable to a Bernoulli noise Ber(L)). 
Therefore, for all practical purposes, the side information at 
the receiver y is statistically equivalent to the output of x 
transmitted through a BSC(Z) * p). Let Fc be the set of 
frozen bits of a channel code for the BSC(D * p). Let the 
encoder transmit the bits up^\p^ to the receiver Since the 
bits belonging to Fg [up J are fixed to zero, the receiver 
now has access to up^ and y. The definition of Fc implies 
that the receiver is now able to decode upo (and hence x) 
with vanishing error probability. To see the nested structure, 
note that the code C2 and its cosets are the different channel 
codes defined by different values of up^\p^ and that these 
codes partition the source code Ci. The capacity achieving 
property of polar codes implies that for N sufficiently large, 
\Fc\ < N{h2{D * p) + S) for any 5 > 0. In addition, if we 
assume that polar codes achieve the rate-distortion bound as 
conjectured in the previous section and Fg C F^ then for 
sufficiently large, the rate required to achieve a distortion D 
is j^\Fc\Fs\ < h2{D*p) - h2{D) + 5 for any 5 > 0. This 
would show that polar codes can be used to efficiently realize 
the Wyner-Ziv scheme. The performance of the above scheme 
for various lengths is shown in Figure |6] 
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